III.  Log Salary Regression Approach
Discussion of Regression Approach

Regression analysis is used here for modeling the relationship between
the dependent variable {log salary) and the set of predictor variables
called salary-relevant characteristics (see appendix A).  The
regression models proposed for this study assume that log salary is a
linear function of the salary-relevant characteristic, plus a random
error. Regression analysis overcomes the weakness of the one-variable-
at-a-time approach of the previous section by studying log salary for
the joint set of salary-relevant characteristics.

The first step in the regression analysis was to develop separate
regression models for male graduates and female graduates which fit the
observed data.^  Separate models were created, rather than one with
sex as a variable, to permit an examination of the determinants of
salary for each sex.  Many models were examined before the final models
were selected.  All of the salary-relevant characteristics available
were used in those exploratory models. The final models were chosen
because they exhibited the best fit to the data (highest proportion of
variance accounted for in log salary) with the fewest possible
salary-relevant characteristics. The fit of the models was judged by
the coefficient of determination (R2).

Both male and female models had an R2 of approximately 0.50.2
They shared the following predictor variables (each term exceeded the
0.01 level of significance): degree level, years of experience, squared
years of experience, major field/job-relatedness, industry grouping,
metropolitan status and occupation.  In the male model, marital status
and enrollment status accounted for a significant proportion of the
variability in log salary.  In the female model, college selectivity
and region accounted for a significant proportion of log salary.  These
models are presented in subsequent sections.

After the regression models were established, it was possible to
analyze the determinants of male and female log salaries by examining
the relative effect on salary of each of the predictor variables in the
model.  This was accomplished by establishing an arbitrary reference
group (graduates who shared membership in the largest category of each
predictor variable) and noting the predicted salary of this group.  One
characteristic of the reference group was then changed, and the new
predicted salary was noted.  The percentage change from the first
salary to the second showed the relative and isolated effect of this
one characteristic on the salary of the reference group.

^Regression coefficients and their standard errors for these models

are found in appendix P.
2Under cross-validation, using the same regression equation, R2

would be expected to be lower.ee Beck et al.   1978a,   197flb;  Stolaenberg 1975;  Mincer 1974).     The use
